Biological Domain Identification Based in Codon Usage by Means of Rule and Tree Induction
نویسندگان
چکیده
There are three domains in living nature: archaea, bacteria and eukarya. It has been shown, trough a number of multivariate tools, that codon usage, a 64 dimensional vector that stablishes how often a given organism makes use of each codon, is related to domain. Another method is proposed here based in rule and tree induction from codon usage of several organisms. It is shown that domain can be identified trough codon usage and a simple set of rules. Two methods were applied, CN2 and C4.5. Obtained rules describe data better than other methods, in the sense that are topological interpretable and have phenomenological meaning.
منابع مشابه
Identification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene
Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...
متن کاملApplication of the rule extraction method to evaluate seismicity of Iran
Assessing seismic hazards involves specifying the likelihood, magnitude and location of earthquakes in a region. Predicting the seismic hazards is the first step in reducing the impact of the damage caused by an earthquake. In this study, to fully utilize all the known parameters which may possibly affect the occurrence of earthquakes (mb ≥ 4.5); a data-driven rule-extraction method called the...
متن کاملBioinformatics Comparison of Codon Usage of Genes Encoding Phosphate Transporter in Terms of Salt Tolerance, Day Length, Temperature and Pollination in Different Plants
In order to study and compare the phosphate transporter gene codon usage and it's respond to the traits like salt tolerance, day length, Pollination and temperature in different plants, 100 isoform from 10 plants are extracted from NCBI website and then analyzed with Gene Infinity and Minitab 16 software. The result shows that the highest codon usage similarity (81.95%) was for wheat a...
متن کاملBioinformatics Comparison of Codon Usage of Genes Encoding Phosphate Transporter in Terms of Salt Tolerance, Day Length, Temperature and Pollination in Different Plants
In order to study and compare the phosphate transporter gene codon usage and it's respond to the traits like salt tolerance, day length, Pollination and temperature in different plants, 100 isoform from 10 plants are extracted from NCBI website and then analyzed with Gene Infinity and Minitab 16 software. The result shows that the highest codon usage similarity (81.95%) was for wheat a...
متن کاملMMDT: Multi-Objective Memetic Rule Learning from Decision Tree
In this article, a Multi-Objective Memetic Algorithm (MA) for rule learning is proposed. Prediction accuracy and interpretation are two measures that conflict with each other. In this approach, we consider accuracy and interpretation of rules sets. Additionally, individual classifiers face other problems such as huge sizes, high dimensionality and imbalance classes’ distribution data sets. This...
متن کامل